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Abstract 

The many successes of synthetic biology have come in a manner largely different from 
those in other engineering disciplines; in particular, without well-characterized and simplified 
prototyping environments to play a role analogous to wind-tunnels in aerodynamics and bread- 
boards in electrical engineering. However, as the complexity of synthetic circuits increases, the 
benefits — in cost savings and design cycle time — of a more traditional engineering approach 
can be significant. We have recently developed an in vitro 'breadboard' prototyping platform 
based on E. coli cell extract that allows biocircuits to operate in an environment consider- 
ably simpler than but functionally similar to in vivo. The simplicity of this system makes 
it a promising tool for rapid biocircuit design and testing, as well as for probing fundamen- 
tal aspects of gene circuit operation normally masked by cellular complexity. In this work 
we characterize the cell-free breadboard using real-time and simultaneous measurements of 
transcriptional and translational activities of a small set of reporter genes and a transcrip- 
tional activation cascade. We determine the effects of promoter strength, gene concentration, 
and nucleoside triphosphate concentration on biocircuit properties, and we isolate the specific 
contributions of essential biomolecular resources — core RNA polymerase and ribosomes — to 
overall performance. Importantly, we show how limits on resources, particularly those involved 
in translation, are manifested as reduced expression in the presence of orthogonal genes that 
serve as additional loads on the system. 

1 Introduction 

The field of synthetic biology has matured to the point where biological parts are regularly assembled 
into modestly complex circuits with wide-ranging applications (i). Unfortunately, the development 
of new biological circuits typically involves long and costly ad hoc design cycles characterized by trial- 
and-error and lacking the prototyping stage essential to other engineering disciplines. More often 
than not, designed circuits fail to operate as expected. The reason for these failures is in many cases 
related to context: the poorly characterized environment in which the system is operating [2— 4)- 
This includes the finite and variable (from cell to cell, condition to condition, and time to time) pools 
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of bio molecular resources such as transcription/translation machinery and nucleoside triphosphates 
(NTPs), weak control over the component DNA concentrations, unpredicted interactions between 
components and circuits and their cellular hosts (5, 6), and any number of other system properties 
with unknown or unknowable effects. 

We have recently developed an in vitro biomolecular 'breadboard' based on E. coli cell extract 
that provides a functional environment similar to in vivo but with significantly reduced complexity 
(7, 8). DNA and mRNA endogenous to the cells is eliminated during extract preparation, so that 
transcription-translation circuits of interest may be operated in isolation without interference by a 
cellular host. The cell-free breadboard also allows for a degree of control over reaction conditions 
and component concentrations that cannot be achieved in vivo. As a prototyping platform, the cell- 
free breadboard provides for a considerable reduction in circuit design cycle time, not only because 
of its relative simplicity when compared with in vivo, but also because it eliminates much of the 
lengthy cloning and cell transformation steps typically required in biocircuit development (9, 10). 
Indeed, cell-free applications for synthetic biology are quickly expanding (11, 12). But beyond its 
potential as an improved circuit development platform, our cell-free breadboard has another signifi- 
cant advantage: its simplicity reveals important details of biocircuit operation normally masked by 
cellular complexity. 

In this work we show a detailed and quantitative characterization of the cell-free breadboard — an 
essential precursor to any biocircuit development and testing application — and explore a number 
of fundamental aspects of biocircuit operation not easily studied in vivo. Central to our work is 
the use of a novel reporter that combines the malachite green RNA aptamer and a fluorescent 
protein for a real-time and simultaneous read-out of the system's transcription and translation 
activity. We establish the functional implications of intrinsic biocircuit properties such as component 
concentration and promoter strength, as well as those of the extrinsic biomolecular resource pool 
that includes nucleoside triphosphates (NTPs) and transcription/translation machinery. Finally, 
through a systematic characterization of the effect of loads on transcriptional and translational 
performance, we show how limits on essential resources, particularly those involved in translation, 
manifest themselves in the form of reduced expression and 'crosstalk' between orthogonal genes. 
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Implications for biocircuit prototyping are discussed. 

2 Results and discussion 

2.1 A combined transcription— translation reporter 

To best characterize transcription- and translation-level performance in the cell-free breadboard 
platform, we use a reporter construct encoding a green or cyan fluorescent protein (deGFP/deCFP) 
along with the malachite green RNA aptamer (MGapt) in the 3' untranslated region (UTR) 
(Fig. 1A). These fluorescent proteins have been previously designed for maximal expression in 
the cell- free system when transcribed from E. coli promoters (7). The 35-base MGapt sequence 
contains a binding pocket for the malachite green dye (13) and allows mRNA dynamics to be easily 
monitored in cell extract with high temporal resolution — as compared to, e.g., radio-labeling and gel 
analysis (14) — and over a wide dynamic range; concentrations of purified deGFP-MGapt transcript 
as low as 37.5 nM and as high as 3 fiM were effectively detected using the MGapt fluorescence 
signal (Fig. SI). (In principle, the upper limit for RNA detection is determined by the amount of 
malachite green dye.) 

Measurements of the decay of deGFP-MGapt transcripts in the cell-free breadboard show that 
degradation is well- described by single exponential decay with 16-18 min half-lives for a wide range 
of concentrations (Fig. S2A). The decay curves for high concentrations of deGFP-MGapt (2 and 

3 fiM) follow those of the lower concentrations only to 15 min, after which the half-lives increase 
dramatically (Fig. S2B). This result is consistent with multimer formation occuring in the highly 
concentrated purified RNA stock (12 /xM); the nucleic acid folding program NUPACK (15) predicts 
20% dimer formation for 4 fiM deGFP-MGapt mRNA and 60% dimer formation for 12 fiM deGFP- 
MGapt mRNA at 37 °C. 

The MGapt fluorescence signal is well-separated from those produced by commonly-used green, 
yellow, and cyan fluorescent proteins. Furthermore, the inclusion of MGapt in the 3' UTR of deGFP 
has little effect on final deGFP levels (Fig. S3), and expression kinetics reported with MGapt are 
consistent with real-time PCR measurements (Fig. S4). Recent studies also have indicated that 
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MGapt is compatible with in vivo characterization (16). Taken together these results suggest 
that deGFP-MGapt is a reliable tool for concurrent measurement of transcription and translation 
activity and a strong addition to the set of similar tools used for real-time monitoring of biocircuit 
performance (cf., (17, 18)). 



FP-MGapt 



TX 





300 400 500 
Time (mins) 



Figure 1: A combined transcription-translation reporter reveals basic features of our cell- free bread- 
board system. (A) The reporter construct encodes an optimized fluorescent protein (FP) along with 
the malachite green RNA aptamer (MGapt) in the 3' UTR. (B) Transcription kinetics reported by 
MGapt for six different template concentrations. Shaded regions indicate standard error over three 
replicates. (C) Translation kinetics reported by deGFP for six different template concentrations. 
Endpoint values represent the total amount of active fluorescent protein produced. 
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2.2 Constitutive gene expression under standard conditions 

In the 'ideal' system with unlimited resources and conditions unchanging with time, the dynamics 
of MGapt and deGFP may be described by a set of ordinary differential equations: 

AMGapt(t) = k TX [DNA] - k deg MGapt (t), (1) 
^deGFP d (t) = k TL MGapt (t) - k mat deGFP d (t), (2) 
^deGFP(t) = k mat deGFP d (t), (3) 

with constants kxx, kd eg , kxi and k ma t and template concentration [DNA], and with deGFPd 
representing the 'dark' (immature) deGFP. (There is no protein degradation term in the model as 
the necessary degradation machinery is absent from the standard cell- free breadboard.) Models of 
this type are common and have the advantage that they may be solved exactly; for example, this 
simple model predicts a steady-state MGapt concentration (= fcyxIpNA]/ kd eg ) and a rate of deGFP 
production that approaches a constant (= [DNA^tx^tl/ kdeg) when k mat t, kd eg t ^> 1. However, 
when the operational demands of a biocircuit outstrip the available resources, or those resources 
exhibit a time- dependent activity, simple models such as this provide little guidance. Precisely how 
resource limits manifest themselves is the focus of the remainder of this work, with the ideal model 
used as a point of comparison. 

With deGFP-MGapt under the control of a strong constitutive promoter, Pr, and 5' UTR 
containing a strong ribosome binding site (RBS), the MGapt expression profiles show clear signs 
of transcriptional resource limits (Fig. IB). Saturation of the transcription machinery occurs as 
template DNA concentration increases; for sufficiently early times, the rates of MGapt production 
are well described by: 



with K ~ 15 nM and V max ~ 90 nM min 1 (Fig. S5). RNA production does not continue indefinitely, 
as evidenced by the absence of a steady-state level of MGapt and, at late times, kinetics that follow 
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a pure exponential decay (Fig. S6A). We find that the MGapt degradation rate constant is not 
the same for all template concentrations, but rather increases with increasing DNA concentrations 
(Fig. S6B). Between 2 and 5 nM DNA template, there is a distinct qualitative change in the MGapt 
profiles, from curves characterized by relatively broad peaks and slow decays to ones that are more 
sharply-peaked. 

The deGFP expression profiles are shown in Fig. 1C. As with the MGapt curves, the time dur- 
ing which protein is produced is limited; however, the precise time t en ^TL past which no additional 
protein is produced is different above and below the 2-5 nM DNA threshold. For all concentrations 
below 2-5 nM, t en ^TL appears fixed at ps 340 minutes. At high concentrations, t en ^TL is substan- 
tially later (at ^500 minutes). These results suggest that the cessation of protein production is 
not likely to be purely due to consumption of resources by the transcription-translation machin- 
ery. Similar results have been noted previously (19) with the suggestion that a number of other 
processes, including NTP hydrolysis and enzyme denaturation, may lead to early termination of 
protein synthesis reactions (20, 21); however, it is worth noting that in our cell-free breadboard 
the protein production time is considerably longer than is typical for other batch-mode cell-free 
reactions (19, 22). 

The deGFP production rates (while t < t en ^TL) also show differences between the high and low 
concentration regimes. The rates could be expected to correlate with the level of MGapt since, in 
the ideal system described by Eqs. (l)-(3), we have that 



Indeed, in the first hour of expression, the time derivative of the (spline fit) deGFP concentration is 
proportional to the MGapt concentration (Fig. S7A). At later times, the behavior of the system is 
no longer ideal and ^deGFP(t) and MGapt (i) diverge, although there remain significant qualitative 
similarities between them (Fig. S7B). One noteworthy difference between deGFP production rates 
and MGapt levels is apparent at low DNA concentrations, where the rate of deGFP production is 
approximately constant for times ~90 mins < t < ~300 mins despite the fact that MGapt is not. 




(5) 
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There is no similar period with a constant deGFP production rate above 5 nM DNA. 

We use two simple metrics to assess system performance under different conditions: the com- 
plete time integral of MGapt concentration (J MGapt) and the endpoint concentration of deGFP 
( [deGFP] en( i). These particular metrics were chosen because they reflect the total transcription- 
and translation-level capacity of the system and are easily comparable between experimental runs. 
For the ideal system we have that: 



as may be seen by integrating Eq. (5) over the full length of the experimental run. (Here we have 
used the fact that the endpoint concentration of immature deGFP is relatively small for late times, 
i.e., [deGFP] d,enc2 *C [deGFP] end .) In the cell-free breadboard, a proportionality relation exists only 
for the first hour of expression below the 2-5 nM DNA threshold, i.e., deGFP(t*) oc J^ 0 MGapt(t)dt 
for t* < 60 mins and [DNA] < 2 nM (Fig. 2A). At higher template concentrations and later times, 
performance deviates from ideality as the chemical make-up of the system is substantially changed 
and the effects of resource limits are more pronounced (Fig. S8). However, it is at this point when 
jMGapt and [deGFP] en d are no longer simply related by Eq. (6) that they may separately provide 
insights into how specific resource limitations are manifested. 

Plotting JMGapt and [deGFP] en ^ as a function of plasmid concentration (Fig. 2B), we clearly 
see the 'linear' regime in which [deGFP] en d is proportional to DNA concentration and a 'saturation' 
regime in which [deGFP] en d versus DNA concentration is sublinear. The existence of a linear regime 
may be easily predicted from the ideal model Eq. (6): 



[deGFP] 



end 



k TL /MGapt, 



(6) 




(7) 
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which, in the limit that krx [DNA] 3> JjMGapt(t), reduces to the simple relation 

[cleGFP] end = [DNA] t end , TL . (9) 

The linear and saturation regimes correspond to the qualitative differences in the MGapt and 
deGFP expression curves described above. Surprisingly, we see no significant change in jMGapt 
at the regime transition point. 




Figure 2: (A) deGFP at various times t* versus MGapt concentration integrated from time t = 0 to 
t = t* for t* = 15, 30, 45, and 60 minutes and a range of DNA template concentrations. (B) Endpoint 
deGFP and integrated MGapt as a function of DNA template concentration. The endpoint deGFP 
level is proportional to the amount of template up to ~2-5 nM. 



2.3 Performance as a function of promoter strength and DNA concen- 
tration 

To establish how promoter strength affects transcription and translation in the cell-free breadboard, 
we tested the reporter construct under the control of two additional constitutive promoters Prl and 
Pr2 made weaker than Pr by single base mutations in the -35 and -10 region, respectively (see 
Materials and Methods). The concentration at which the system transitions from the linear regime 
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to the saturation regime is increased for these weaker promoters, up to ~10 nM for Prl and ~20 
nM for Pr2 (Fig. S9), consistent with other recent work suggesting a maximum capacity between 
16 and 32 nM DNA (9). Thus, we see a performance trade-off between DNA concentration and 
promoter strength: a weaker promoter allows for linear regime performance with higher template 
concentrations. 

The relationship between DNA template concentration, promoter strength, integrated RNA, 
and final protein concentration is not a simple one (Fig. 3A). For example, with 2 nM DNA, 
jMGapt produced using Prl and Pr2 is 40% and 15% of the Pr value, respectively, and [deGFP] en d 
is 11% and 0.4% of that produced by Pr. The percentages are different with 20 nM DNA: JMGapt 
produced using Prl and Pr2 is 70% and 32% of Pr, respectively, and [deGFP] e „d is 30% and 2% 
of Pr. (Separate comparisons of J MGapt and [deGFP] en( 2 for the three promoters can be found in 
Fig. SIO.) 

The dramatic increase in the Pr promoter [deGFP] end -/MGapt curve found at the regime transi- 
tion point may be explained by the differential transcriptional dynamics in the linear and saturation 
regimes — distributed versus peaked — coupled with decreasing activity of the translation machinery. 
A consequence of a time-dependent reduction in translational efficiency, as reported for other cell- 
free systems (19), is that although transcription may take place late in the experiment, the resulting 
mRNAs are less translatable. 

2.4 The role of NTPs 

The standard platform contains the natural NTPs essential for biocircuit operation, in concentra- 
tions of 1.5 mM ATP and GTP and 0.9 mM CTP and UTP (23). Among their many cellular 
functions, ATP and GTP play a crucial role in translation, and all four NTPs are used in transcrip- 
tion as substrates in the synthesis of RNA. NTPs thus serve to couple a biocircuit's transcription 
and translation layers together, with an impact that is not intuitively obvious but that can be 
significant (24)- As a result, understanding precisely how changes in NTP concentration affect 
performance is of paramount importance. 

We supplemented the system with an additional 1.25 mM of each NTP, an increase of ~83% 
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/MGapt (pM-hr) /MGapt (pM-hr) 

Figure 3: (A) Endpoint deGFP versus integrated MGapt for three different promoters (Pr, Prl, 
and Pr2) and with the cell- free breadboard supplemented with 1.25 mM of each of the four NTPs. 
DNA template concentrations are 0.02-20 nM. (B) Comparison of linear regime performance, with 
and without additional NTPs. 

ATP/GTP and ~138% CTP/UTP. In the high DNA concentration (saturation) regime, the addi- 
tional NTPs support a considerable increase in J MGapt but lead to a reduction in [deGFP] enc ; of 
up to 20% (Fig. 3A), a result of a dramatic broadening of the MGapt expression curves and com- 
pression of deGFP curves at late times (Fig. Sll). This suggests that while NTPs may help at the 
transcription level, the excess transcripts are not translatable, and that perhaps the resources used 
to produce those transcripts may have been taken at the expense of reporter protein production. 

In the low DNA concentration (linear) regime, additional NTPs lead to an ~40-50% increase 
in [deGFP] (Fig. 3B). This may be primarily attributed to an increase in the value of t en ^TL 
(to ~450-500 minutes; see Fig. Sll). That is, the rate of production is relatively fixed but the 
productive period is extended. J MGapt also increases at low DNA concentrations. Save for the 
increase in t e nd,TL, the extra NTPs have little effect on the shapes of the MGapt and deGFP profiles 
(Fig. Sll). 
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2.5 Performance of a simple transcription— translation cascade 

We also investigated the effect of adding an intermediate layer of transcription and translation on our 
reporters. The two-stage 'cascade' circuit consists of constitutively-expressed T7 RNAP under the 
control of Pr, Prl, or Pr2 and the deGFP-MGapt construct downstream of a T7-specific promoter 
(Fig. 4). (T7 RNAP is convenient to use for this purpose since, unlike the native E. coli RNAP, it 
is a single-subunit RNAP that is easy to incorporate onto a single plasmid and it does not compete 
with the core RNAP for sigma factors.) If the consumption of NTPs by transcription/translation 
is in fact performance-limiting, we would expect the output of the cascade to be reduced relative 
to constitutive expression since more NTPs are used in its operation. Alternatively, if expression is 
limited (at least in part) by a reduction in the activity of the native RNAP, the introduction of T7 
RNAP may extend the lifetime of the system. 

There are substantial qualitative differences in T7 cascade expression as compared with a single- 
stage constitutive promoter. RNA increases rapidly and exhibits a long, slow decay (Fig. S12), and 
there is a ~30-60 minute delay in protein expression (Fig. S13). A higher reporter DNA concen- 
tration leads to a shorter delay and faster rise in expression, but the final deGFP concentration 
is often below the level achieved with a lower reporter concentration. This suggests a trade-off in 
cascade-driven protein production that may be the result of fuel consumption: if deGFP is produced 
more quickly, then the production appears to arrest sooner. 

The cascade protein output is largely determined by the strength of the promoter that drives 
T7 RNAP production (Fig. 4). Weaker promoters (Prl and Pr2) lead to a wide range of deGFP 
levels with only small variations in the Prl- and Pr2-T7 RNAP plasmid concentrations, and for 
any fixed concentration of the T7 RNAP plasmid, changes in reporter concentration (over an order 
of magnitude) do not affect deGFP output appreciably. When the strong Pr promoter is used, 
deGFP levels saturate at a level independent of the Pr-T7 RNAP concentration while MGapt 
levels vary substantially. The T7 cascade thus provides for independent tuning of RNA and protein 
outputs. There exist regions of overlap where cascades with high concentrations of weaker first-stage 
promoters behave identically to low concentrations of stronger first-stage promoters; for example, 
1 nM Prl-T7 RNAP produces an output ( [deGFP] en ^ and jMGapt) similar to that produced by 
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jMGapt (|jM hr) 

Figure 4: (A) The T7 cascade consists of constitutively-expressed T7 RNAP under the control of Pr, 
Prl, or Pr2 and the deGFP-MGapt construct downstream of a T7-specific promoter. (B) Endpoint 
deGFP versus integrated MGapt for the T7 cascade with Pr-, Prl-, or Pr2-T7 RNAP upstream 
of the PT7-deGFP-MGapt reporter (1, 2, and 10 nM). There exist regions of overlap where high 
concentrations of weaker first-stage promoters perform similarly to low concentrations of stronger 
first-stage promoters. 

0.1 nM Pr-T7 RNAP, and 1 nM Pr2-T7 RNAP lies between 0.1 and 0.2 nM Prl-T7 RNAP. This 
equivalence was not present with the one-stage simple expression and may be due to the fact that 
in all versions of the cascade the promoters driving deGFP are identical. 

In the simple expression case we found that adding NTPs to the system led to a considerable 
increase in transcription in the saturation regime, but that the excess transcripts were not translated. 
We set out to see how the same addition of NTPs affects output of the T7 cascade with a strong 
first-stage promoter. As before, we supplemented the system with an additional 1.25 mM of each 
NTP. The resulting kinetics can be seen in Figs. S14 and S15, and compared with Figs. S12 and S13. 
Again we see a significant increase in the transcriptional activity; peaks are taller and broadened 
and the differences between different PT7-deGFP-MGapt concentrations are more pronounced. And 



13 



Downloaded from http://biorxiv.org/on September 18, 2014 



while we do not see a decrease in deGFP as we did with the Pr-deGFP-MGapt construct, there is 
little to be gained at the translational level by supplying excess NTPs. 

2.6 Resource utilization and crosstalk between orthogonal genes 

Recent studies have revealed that unintended and/or indirect coupling between genes, or 'crosstalk', 
can arise as a side-effect of resource sharing ( 25-31 ) . Resources include those which are consumed 
during circuit operation (e.g., NTPs) as well as those like the transcription/translation machinery 
and other enzymes that are not consumed but are limited and shared across different parts of the 
circuit. To clearly distinguish between crosstalk at the transcription stage (that may arise due to 
competition for RNAP or housekeeping sigma factor) and at the translation stage (arising from, e.g., 
a limited ribosome pool), we used a deCFP-MGapt reporter construct to assay system performance 
in the presence of two different 'loads': (1) the 'native' deGFP, containing the same UTR with 
strong RBS used throughout this work, and (2) deGFP with the RBS deleted from the UTR 
(ARBS-deGFP). (We use the term 'load' to refer to components that consume some of the fixed 
transcriptional and translational capacity of the system, in analogy with electric loads that consume 
electric power.) Reporter and loads are all placed under the control of the same Pr promoter. The 
loads are orthogonal to the reporter in the sense that they have no direct regulatory interaction 
with it, e.g., as activating or repressing transcription factors. With this particular setup, crosstalk 
at the transcription and translation levels appears as changes in MGapt and deCFP fluorescence 
signals, respectively. The use of the Pr-ARBS-deGFP construct guarantees that any crosstalk is 
strictly transcriptional, as no RBS is present to sequester ribosomes away from the production 
of deCFP. In principle, purely translational loads such as high concentrations of purified mRNA 
could be used; however, purified deGFP-MGapt transcripts at high concentration exhibit nonideal 
behavior (as noted earlier; cf. Fig. S2), and the effect of this nonideality on reporter and load output 
is unknown. 

We find that an increase in loading generally leads to a decrease in reporter expression, although 
the effect on transcription is different from that on translation (Fig. 5). For example, a similar 
~250 nM-hr variation in jMGapt is observed for all reporter concentrations as loading increases. 
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At the translation level, the effect is strongly concentration-dependent: the influence of the load on 
[deCFP] en d is small at 1 nM reporter DNA but significant at 10 nM reporter. Thus, as the amount 
of reporter DNA increases, the demand for resources required for production of deCFP increases as 
well and the translational crosstalk becomes much more pronounced. This result underscores the 
fact that there is a maximum translation capacity limiting the total amount of protein that can be 
produced; as [deGFP] em i goes up, [deCFP] en ^ necessarily goes down. The top right corners of the 
plots in Fig. 5 represent a performance regime that appears to be inaccessible. 



reporter = Pr-deCFP-MGapt 
load = Pr-deGFP 
load concentrations: 

• = 1 nM 

■ = 2 nM 

4 = 5nM 

A= 10 nM 



0 0.2 0.4 0.6 0.8 1.0 1.2 0 2 4 6 8 10 12 14 
/MGapt (|jM hr) [deCFP] snd (RFU x10 3 ) 



Figure 5: Effect of expression of load Pr-deGFP on concurrent expression of reporter Pr-deCFP- 
MGapt. Loading is quantified as endpoint deGFP concentration and plotted against integrated 
MGapt (left) and endpoint deCFP (right). Variation in J MGapt with increasing load is similar for 
all reporter concentrations, whereas at the translation level the crosstalk effect is highly dependent 
on load and reporter concentrations. There is a maximum translation capacity to the system that 
limits the total amount of protein that can be produced, as indicated by the inaccessible performance 
regime. 

The loading effects seen in Fig. 5 suggest that translation resources may be more limiting to 
system performance. To confirm this result, we compare the jMGapt-[deCFP] en( i relationships for 
Pr-deGFP and Pr-ARBS-deGFP (Fig. 6). When the RBS is present, we see that in general an 
increase in load leads to a decrease in jMGapt and [deCFP] en( i (Fig. 6, left). When the RBS is 
absent (Fig. 6, right), for the most part the load has no effect on performance, except for at high 
concentrations of both load and reporter, at which point we note a decrease in [deCFP] en< 2 and 
increase m /MGapt. 
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load = Pr-deGFP load = Pr-ARBS-deGFP 



reporter = Pr-deCFP-MGapt 
load concentrations: 

• = 1 nM 

■ = 2 nM 

+ = 5nM 

A= 10 nM 



0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 
/MGapt (|jM-hr) /MGapt (|jM-hr) 



Figure 6: Effect of expression of loads Pr-deGFP (left) and Pr-ARBS-deGFP (right) on concurrent 
expression of reporter Pr-deCFP-MGapt. Increasing Pr-deGFP tends to decrease both jMGapt 
and [deCFP] en d. Without the RBS, only when load and reporter concentrations are high do we see 
a crosstalk effect: a decrease in [deCFP] en( 2 and increase in /MGapt. 

2.7 Relevance for in vivo 

While it can be expected that specific circuit behaviors will manifest themselves to different degrees 
in cell-free environments versus in vivo, nevertheless there is significant potential for cell-free work 
to contribute to understanding how circuits function in living systems. One example may be found 
in our 'resource competition' assays, through which we were able to quickly and clearly observe the 
translation machinery serving as a significant limiting resource. It has been suggested — by recent 
theoretical work (29) as well as by several other studies on ribosome utilization (30, 32-34) — that 
similar ribosome loading effects exist in vivo, despite the fact that live cells are able to produce 
additional translation machinery. 

Other consequences of a limited ribosome pool found in the cell-free breadboard may also be 
found in vivo. In particular, it is known that translating ribosomes protect their template mRNAs 
from the action of endonucleases (35) and that ribosome spacing is a determinant of degradation 
rate (36). Thus, if demands on a system are such that the available ribosome pool is insufficient to 
densely cover the number of transcripts, increased endonucleolytic activity would lead to an increase 
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in the RNA degradation rate constant. Consistent with this hypothesis we note an increase in the 
degradation rate constant when the system transitions from the linear to the saturation regime 
(Fig. S6). 

Ribosomes are not, of course, the only molecular resource that may be found in short supply 
and that can, as a result of their limited number, lead to crosstalk between unrelated circuits. 
Examples include the E. coli ClpXP protein degradation machinery (26) and transcription factors 
(TFs) with relatively large numbers of targets (37). (Although the latter theoretical study dealt 
specifically with TF titration, the thermodynamic model may be generalizable to other 'targeting' 
biomolecules.) One particularly interesting class of candidate sources of crosstalk is the RNases. It 
has been suggested that competition for a relatively small number of RNases by a large number 
of RNA molecules can introduce unintended correlations (24), and evidence for this may be found 
in our results: the addition of the untranslated ARBS-deGFP load in amounts higher than or 
comparable to the reporter results in an increase in jMGapt (Fig. 6). In this case, the load 
presents a large number of new targets for degradation enzymes, drawing them away from the RNA 
reporter and thus indirectly leading to the increase in JMGapt. We are currently unaware of in 
vivo results that demonstrate crosstalk via RNases; however, given the ribosome loading effects 
seen in both cell-free and in vivo systems, it is an intriguing possibility worthy of exploration, with 
particular relevance for RNA-based synthetic regulatory circuits (38-40). 

The cell-free breadboard may be useful in predicting or confirming a number of other in vivo 
effects arising due to resource limits. For example, in a recent modeling study it was suggested that 
different combinations of promoter and RBS strengths can result in comparable protein output with 
different loads on the cellular expression machinery, and that codon usage can introduce a bottleneck 
that impacts the expression of other genes (41)- The degree of precise control that exists in the cell- 
free breadboard — for example, control over DNA concentration and known induction levels without 
an intervening cell membrane — makes it an ideal platform for investigating this and other related 
questions. 
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2.8 On biocircuit prototyping 

Despite recent developments in standardized part libraries and rapid assembly tools (e.g., {42, 43)), 
synthetic biology still lacks the accepted prototyping platforms and protocols common to other 
engineering disciplines. For the purposes of prototyping, one particular advantage of our cell-free 
breadboard is the rapid testing cycle it permits: save for the initial cloning, transformation, and 
plasmid preparation, none of the individual assays performed in this work required the many hours 
of cell treatment typically needed for in vivo studies. With plasmids in hand, the time from cell-free 
experiment setup to first results is a matter of minutes. 

Problems associated with limits on the cell-free breadboard system capacity may be mitigated 
when operating in regimes that yield predictable response; for example, in the low DNA concen- 
tration regime where the protein production rate is approximately constant until a well-defined, 
concentration-indepedent end time, and the final protein concentration is proportional to the 
amount of template DNA (Fig. 7). The boundaries of this linear regime can change with pro- 
moter strength and NTP concentration; however, even with the strongest promoter tested, ~6 
hours of predictable performance may be achieved with measurable fluorescent protein signal over 
a wide range of DNA template concentrations. We advocate using the linear regime for cell-free 
circuit testing or other applications that require linear response. The high DNA concentration 
(saturation) regime is best suited for applications when maximum yield is desired but the linearity 
of the DNA-protein relationship is not essential. 

And there are a number of ways in which limits on the capacity of the cell-free breadboard may 
be raised. The functional lifetime of the system, which in bulk operation is limited by unidentified 
mechanisms reducing the activity of the transcription/translation machinery, may be increased 
using dialysis membranes and vesicles, up to 16 and 100 hours, respectively (8, 44)- Reaction 
times may be further extended with the use of microfluidics or other continuous-flow devices, as 
demonstrated with other cell-free environments {45, 4@)- Also, the addition of purified proteins 
such as T7 RNAP or sigma factors could potentially support an increase in capacity at no additional 
cost to the system; in related work it has been shown that purified GamS protein can be added to 
prevent degradation of linear DNA {9). Ideally, a combination of strategies should be employed to 
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Figure 7: DNA concentration and protein production time boundaries of the linear regime, in 
which the rate of protein production is approximately constant and the final protein concentration 
is proportional to the amount of template DNA. 



take maximum advantage of the cell-free breadboard. The ease with which these strategies can be 
employed, along with the relative simplicity of the system and the control that it offers, makes it a 
promising platform for synthetic biocircuit prototyping. 



3 Materials and Methods 
3.1 Cell- free system and reactions 

The breadboard environment consists of a crude cytoplasmic extract from E. coli containing soluble 
proteins, including the entire endogenous transcription-translation machinery and mRNA and pro- 
tein degradation enzymes (7, 8). Detailed instructions for extract preparation can be found in (23). 
To avoid variation between different extract preparations we used the same batch of extract for all 
experiments. Reactions took place in 10 fil volumes at 29°C No significant toxicity was observed 
for typical deGFP expression experiments when up to 20 fiM malachite green dye was included in 
the reaction; the dye concentration was thus fixed at 10 /iM for all experiments. All experiments 
were run for 14 hours, a time found to be sufficiently long for the translational capacity of the 
system to be exhausted under all conditions tested. 
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3.2 Reporters 

Real-time fluorescence monitoring of mRNA dynamics was performed using the malachite green 
aptamer (MGapt) (13) incorporated in the 3' UTR of the fluorescent protein reporter genes, 15 
bases downstream of the stop codons. This location of MGapt insertion was chosen after a number 
of other possible locations were tested and found to give less accurate measures of RNA dynamics. 
For example, incorporation of MGapt within the 5' UTR upstream of the RBS led to decreased 
MGapt fluorescence signal, a result that may be due to the preference of 5' end degradation by 
the dominant endonuclease in E. coli, RNase E (^7). This is consistent with a recent study on the 
Spinach fluorescent RNA aptamer (17) in which it was reported that incorporation of the aptamer 
in the 3' UTR region led to stronger fluorescence than in the 5' UTR. It was also found that a 6- 
base spacing between the stop codon of deGFP and MGapt affected the protein expression level to 
some extent, but a 10-base and 15-base spacing showed equivalent MGapt fluorescence signal levels 
without affecting protein expression. The fluorescent proteins deGFP and deCFP were previously 
designed to be more translatable in the cell- free system (7). The UTR controlling translation of 
deGFP (eGFP-A 6 _22g) and deCFP contained the T7 gene 10 leader sequence for highly efficient 
translation initiation (7). All transcription units included the T500 transcriptional terminator 
except for PT7-deGFP-MGapt which contained T7 terminator. 

Fluorescence measurements were made in triplicate in a Biotek plate reader at 3 minute intervals 
using excitation/emission wavelengths set at 610/650 nm (MGapt), 485/525 nm (deGFP), and 
433/475 nm (deCFP). Error bars in figures showing fluorescence or integrated fluorescence indicate 
standard error over replicates. The reported protein production end times are the times at which the 
protein concentrations reached 95% of their final values, i.e., deGFP (t en d t TL) — 95% x [deGFP] end . 

3.3 Plasmids and Bacterial strains 

Plasmids was created using standard cloning methods. The plasmid pBEST-Luc (Promega) was 
used as a template for all constructs except for the PT7-deGFP-MGapt construct which was derived 
from the plasmid pIVEX2.3d (Roche). The same antibiotic resistance gene was used with each 
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plasmid to ensure that any burden on the system due to the expression of these 'background' 
proteins was the same for each construct. All plasmids used are listed in Table SI. Plasmid DNAs 
used in cell-free experiments were prepared using Qiagen Plasmid Midi prep kits. E. coli strains 
KL740 (which contains lambda repressor to control for Pr promoter) or JM109 were used. LB 
media with 100 /xg/mL carbenicillin was used to culture cells. 

The Pr promoter is that of the lambda phage repressor gene cro. Promoters Prl and Pr2 were 
each modified from Pr with a single base mutation in the -35 and -10 region, respectively. The 
sequences, with mutations highlighted by □, are 

Pr: TGAGCTAACACCGTGCGTGTTGACAATTTTACCTCTGGCGGTGATAATGGTTGCA 



Prl: TGAGCTAACACCGTGCGTGT A GACAATTTTACCTCTGGCGGTGATAATGGTTGCA 



Pr2: TGAGCTAACACCGTGCGTGTTGACAATTTTACCTCTGGCGGTG C TAATGGTTGCA 



3.4 Preparation of pure mRNA and qRT-PCR 

RNA was transcribed using a linear template PCR-amplified from pIVEX2.3d PT7-deGFP-MGapt 
including T7 promoter and T7 terminator region. The transcription reaction was prepared as a total 
volume of 100 /iL with 0.1 /xM linear DNA template, 20% (v/v) T7 RNA polymerase (Cellscript), 

7.5 mM each NTP (Epicentre), 24 mM MgCl 2 (Sigma), 10% (v/v) 10 x transcription buffer, and 
1% (v/v) thermostable inorganic pyrophosphatase (New England Biolabs). After an overnight 
incubation at 37°C, the reaction mixture was run on 1% agarose gel, RNA bands that correpond 
to full-length transcript were excised and eluted from gel by the Freeze-N-Squeeze column (Biorad) 
and resuspended in water. Concentrations of purified RNA were determined spectrophotometrically 
using Nanodrop. 

For qRT-PCR, 1 /iL samples were taken at different time points from a tube containing reaction 
mixture at 29°C and diluted 50-fold in water. These samples were stored at -80°C until used. 
Afterward the samples were further diluted to a final dilution of 1:5000. Two /xL of samples were 
analyzed in 50 /iL reactions of the Power SYBR Green RNA-to-CT 1-Step kit (Life Technologies) 
in the MX3005 real-time PCR machine (Agilent Technologies). Primers amplified a region of the 
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deGFP gene closer to its 3' end (424-597 nt) and were used at 0.3 fiM concentrations. Concentrations 
of deGFP-MGapt RNA in the sample were determined from a standard curve of dilutions of purified 
mRNA in a range from 0.6 to 60 pM mRNA per PCR reaction. 
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Supporting Information 

Alternative performance metrics 

For the purposes of characterization of a particular biocircuit testing environment, in vitro or in vivo, 
there are any number of performance metrics that may be used. We have chosen to use integrated mRNA 
and final protein concentration since they intuitively represent the total transcriptional and translational 
capacity of a system. Other metrics, such as the mRNA and protein production rates and end times, are 
complementary to those used in this work and may be particularly informative depending on the specific 
circuit or system requirement. 

In Figs. S16 and S17 we show the maximum deGFP and MGapt production rates as a function of 
reporter concentration under different conditions. Using these measures, stark differences between simple 
one-stage gene expression and expression from the two-stage T7 cascade can be seen; for example, at low 
reporter concentrations, the maximum production rate of deGFP in the cascade is considerably larger than 
the one-stage rate, even when the strongest promoter is used. The cascade deGFP rates are also relatively 
flat with respect to reporter concentration. There is also a clear effect of NTPs on peak rates when a 
cascade is used versus simple expression. In the former case, additional NTPs provide for a significant 
increase in the maximum protein and mRNA production rates, whereas NTPs have little to no effect on 
maximum production rates for the Pr-deGFP-MGapt construct. 

The deGFP production end time t en ^TL is shown for all conditions in Fig. S18. t e nd,TL is between 
330 and 350 minutes for simple expression in the 'linear' regime, but as high as ~580 minutes in some of 
the conditions and concentrations tested. The end time is a particularly good measure of system capacity 
when extended performance is required. 
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Figure SI: Calibration curve relating concentration of purified deGFP-MGapt transcript to initial MGapt 
fluorescence signal. 




Figure S2: Decay of purified deGFP-MGapt transcripts in the cell-free breadboard. (A) Degradation is 
well-described by single exponential decay with 16-18 min half-lives for a wide range of concentrations. 
(B) At high concentrations, the decay curves follow those of the lower concentrations only to 15 min, after 
which the half-lives increase dramatically. 
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Figure S3: Comparison of total deGFP fluorescence produced by Pr-deGFP and Pr-deGFP-MGapt con- 
structs. The incorporation of MGapt in the 3' UTR downstream of deGFP leads to only a slight increase 
in the amount of protein produced relative to deGFP alone, a result we attribute to an increase in the 
stability of the fusion transcript conferred by the MGapt. 
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Figure S4: Comparison of normalized real-time PCR measurements and MGapt signal for the T7 RNAP 
cascade circuit (described in the section titled "Performance of a simple transcription-translation cascade") 
with 1 nM Pr-T7 RNAP and 5 nM PT7-deGFP-MGapt. Shaded region indicates standard error over 
replicates. 
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Figure S5: MGapt production at early times suggest transcriptional machinery is saturated as DNA 
concentration increases. Fits of MGapt production rates in the first 30 minutes of expression follow a 
Michaelis-Menten form with Michaelis constant K ~15 nM. 
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Figure S6: (A) MGapt expression curves are well-fit by a single exponential at late times. (B) Decay 
constants kdeg determined using a single exponential fit to data after 5 hours of expression, kdeg increases 
with increasing DNA concentration. 
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Figure S7: (A) The time derivative of the (spline fit) deGFP concentration is proportional to the MGapt 
concentration in the first hour of expression. (B) As the experiment goes on and the behavior of the system 
is no longer ideal, the shapes of the 4r deGFP (t) deviate from MGapt (t) (although there remain significant 
qualitative similarities between them; cf. Fig. 1C. 




Figure S8: deGFP at various times t* versus MGapt level integrated from time t = 0 to t = t* for a 
range of Pr-deGFP-MGapt concentrations, with t* = 1, 2, . . . , 14 hrs indicated with O- Below the 'linear'— 
'saturation' regime transition concentration, J*^ Q MGapt appears proportional to deGFP (i*) until protein 
synthesis stops at t* ~ 340 minutes, albeit with a proportionality constant that is different for different 
DNA template concentrations. Above the transition concentration, protein production stops later in the 
experiment and the J^ Q MGapt plateau is relatively short. Dashed lines showing t* = 6 hrs and t* = 10 
hrs are shown for reference. 
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Figure S9: Endpoint deGFP versus plasmid concentration for weak promoters Prl (left) and Pr2 (right). 
Unlike the Pr promoter for which the linear regime ends at 2-5 nM DNA, for Prl and Pr2, the linear 
regime extends up to ~10 nM and ~20 nM, respectively. Dashed line shows a linear fit to the data. 
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Figure S10: Comparison of integrated MGapt (left) and endpoint deGFP (right) for promoters Pr, Prl, 
and Pr2. 
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Figure Sll: MGapt (left) and deGFP (right) expression kinetics when breadboard is supplemented with 
1.25 mM of each of the four NTPs. Shaded regions indicate standard error over replicates. 
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Figure S12: MGapt expression curves for T7 cascade tested with four different concentrations of first- 
stage T7 RNAP plasmid (0.1, 0.2, 0.5, and 1 nM Pr-T7 RNAP) and three different concentrations of the 
second-stage plasmid (1, 2, and 10 nM PT7-deGFP-MGapt). Shaded regions indicate standard error over 
replicates. 
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Figure S13: deGFP expression curves for T7 cascade tested with four different concentrations of the first- 
stage T7 RNAP plasmid (0.1, 0.2, 0.5, and 1 nM Pr-T7 RNAP) and three different concentrations of the 
second-stage plasmid (1, 2, and 10 nM PT7-deGFP-MGapt). Shaded regions indicate standard error over 
replicates. 
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Figure S14: MGapt expression curves for T7 cascade tested with four different concentrations of first-stage 
T7 RNAP plasmid (0.1, 0.2, 0.5, and 1 nM Pr-T7 RNAP), three different concentrations of the second-stage 
plasmid (1, 2, and 10 nM PT7-deGFP-MGapt), and with the cell-free breadboard supplemented with 1.25 
mM of each of the four NTPs. Shaded regions indicate standard error over replicates. 
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Figure S15: deGFP expression curves for T7 cascade tested with four different concentrations of the first- 
stage T7 RNAP plasmid (0.1, 0.2, 0.5, and 1 nM Pr-T7 RNAP), three different concentrations of the second- 
stage plasmid (1, 2, and 10 nM PT7-deGFP-MGapt), and with the cell-free breadboard supplemented with 
1.25 mM of each of the four NTPs. Shaded regions indicate standard error over replicates. 
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Figure S16: Maximum deGFP production rate as a function of reporter concentration under different 
conditions. 
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Figure S17: Maximum MGapt production rate as a function of reporter concentration under different 
conditions. 
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Figure S18: deGFP production end time t en d,TL under different conditions. 
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Supplementary tables 



Table SI: Genotypes of the plasmids used in this study. 



Plasmid name 


Transcription unit (TU) 


TU 

size (bp) 


Backbone/resistance 


pBEST-Pr-GFP 


P R :deGFP:T500 


810 


ColEl/Amp R 


pBEST-Pr-ARBS-GFP 


P R :ARBS-deGFP:T500 


775 


ColEl/Amp R 


pBEST-Pr-GFP-MG 


P R :deGFP-MGapt:T500 


869 


ColEl/Amp R 


pBEST-Prl-GFP-MG 


P R i:deGFP-MGapt:T500 


869 


ColEl/Amp R 


pBEST-Pr2-GFP-MG 


P R2 :deGFP-MGapt:T500 


869 


ColEl/Amp R 


pBEST-Pr-CFP-MG 


P R :deCFP-MGapt:T500 


869 


ColEl/Amp R 


pBEST-Pr-T7RNAP 


P R :T7 RNAP:T500 


3032 


ColEl/Amp R 


pBEST-Prl-T7RNAP 


Pri:T7 RNAP:T500 


3032 


ColEl/Amp R 


pBEST-Pr2-T7RNAP 


P R2 :T7 RNAP:T500 


3032 


ColEl/Amp R 


pIVEX- P T7-GFP-MG 


P T 7:deGFP-MGapt:T7term 


939 


ColEl/Amp R 
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